Overview

Dataset Statistics

Number of Variables 122
Number of Rows 307511
Missing Cells 9.1525e+06
Missing Cells (%) 24.4%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 536.7 MB
Average Row Size in Memory 1.8 KB
Variable Types
  • Numerical: 70
  • Categorical: 52

Dataset Insights

APARTMENTS_AVG and APARTMENTS_MODE have similar distributions Similar Distribution
APARTMENTS_AVG and APARTMENTS_MEDI have similar distributions Similar Distribution
BASEMENTAREA_AVG and BASEMENTAREA_MODE have similar distributions Similar Distribution
BASEMENTAREA_AVG and BASEMENTAREA_MEDI have similar distributions Similar Distribution
YEARS_BEGINEXPLUATATION_AVG and YEARS_BEGINEXPLUATATION_MODE have similar distributions Similar Distribution
YEARS_BEGINEXPLUATATION_AVG and YEARS_BEGINEXPLUATATION_MEDI have similar distributions Similar Distribution
YEARS_BUILD_AVG and YEARS_BUILD_MODE have similar distributions Similar Distribution
YEARS_BUILD_AVG and YEARS_BUILD_MEDI have similar distributions Similar Distribution
COMMONAREA_AVG and COMMONAREA_MODE have similar distributions Similar Distribution
COMMONAREA_AVG and COMMONAREA_MEDI have similar distributions Similar Distribution
ELEVATORS_AVG and ELEVATORS_MEDI have similar distributions Similar Distribution
ENTRANCES_AVG and ENTRANCES_MODE have similar distributions Similar Distribution
ENTRANCES_AVG and ENTRANCES_MEDI have similar distributions Similar Distribution
FLOORSMAX_AVG and FLOORSMAX_MODE have similar distributions Similar Distribution
FLOORSMAX_AVG and FLOORSMAX_MEDI have similar distributions Similar Distribution
FLOORSMIN_AVG and FLOORSMIN_MODE have similar distributions Similar Distribution
FLOORSMIN_AVG and FLOORSMIN_MEDI have similar distributions Similar Distribution
LANDAREA_AVG and LANDAREA_MODE have similar distributions Similar Distribution
LANDAREA_AVG and LANDAREA_MEDI have similar distributions Similar Distribution
LIVINGAPARTMENTS_AVG and LIVINGAPARTMENTS_MODE have similar distributions Similar Distribution
LIVINGAPARTMENTS_AVG and LIVINGAPARTMENTS_MEDI have similar distributions Similar Distribution
LIVINGAREA_AVG and LIVINGAREA_MODE have similar distributions Similar Distribution
LIVINGAREA_AVG and LIVINGAREA_MEDI have similar distributions Similar Distribution
NONLIVINGAPARTMENTS_AVG and NONLIVINGAPARTMENTS_MODE have similar distributions Similar Distribution
NONLIVINGAPARTMENTS_AVG and NONLIVINGAPARTMENTS_MEDI have similar distributions Similar Distribution
NONLIVINGAREA_AVG and NONLIVINGAREA_MEDI have similar distributions Similar Distribution
APARTMENTS_MODE and APARTMENTS_MEDI have similar distributions Similar Distribution
BASEMENTAREA_MODE and BASEMENTAREA_MEDI have similar distributions Similar Distribution
YEARS_BEGINEXPLUATATION_MODE and YEARS_BEGINEXPLUATATION_MEDI have similar distributions Similar Distribution
YEARS_BUILD_MODE and YEARS_BUILD_MEDI have similar distributions Similar Distribution
COMMONAREA_MODE and COMMONAREA_MEDI have similar distributions Similar Distribution
ENTRANCES_MODE and ENTRANCES_MEDI have similar distributions Similar Distribution
FLOORSMAX_MODE and FLOORSMAX_MEDI have similar distributions Similar Distribution
FLOORSMIN_MODE and FLOORSMIN_MEDI have similar distributions Similar Distribution
LANDAREA_MODE and LANDAREA_MEDI have similar distributions Similar Distribution
LIVINGAPARTMENTS_MODE and LIVINGAPARTMENTS_MEDI have similar distributions Similar Distribution
LIVINGAREA_MODE and LIVINGAREA_MEDI have similar distributions Similar Distribution
NONLIVINGAPARTMENTS_MODE and NONLIVINGAPARTMENTS_MEDI have similar distributions Similar Distribution
NONLIVINGAREA_MODE and NONLIVINGAREA_MEDI have similar distributions Similar Distribution
OBS_30_CNT_SOCIAL_CIRCLE and OBS_60_CNT_SOCIAL_CIRCLE have similar distributions Similar Distribution
AMT_REQ_CREDIT_BUREAU_DAY and AMT_REQ_CREDIT_BUREAU_WEEK have similar distributions Similar Distribution
AMT_REQ_CREDIT_BUREAU_MON and AMT_REQ_CREDIT_BUREAU_QRT have similar distributions Similar Distribution
OWN_CAR_AGE has 202929 (65.99%) missing values Missing
OCCUPATION_TYPE has 96391 (31.35%) missing values Missing
EXT_SOURCE_1 has 173378 (56.38%) missing values Missing
EXT_SOURCE_3 has 60965 (19.83%) missing values Missing
APARTMENTS_AVG has 156061 (50.75%) missing values Missing
BASEMENTAREA_AVG has 179943 (58.52%) missing values Missing
YEARS_BEGINEXPLUATATION_AVG has 150007 (48.78%) missing values Missing
YEARS_BUILD_AVG has 204488 (66.5%) missing values Missing
COMMONAREA_AVG has 214865 (69.87%) missing values Missing
ELEVATORS_AVG has 163891 (53.3%) missing values Missing
ENTRANCES_AVG has 154828 (50.35%) missing values Missing
FLOORSMAX_AVG has 153020 (49.76%) missing values Missing
FLOORSMIN_AVG has 208642 (67.85%) missing values Missing
LANDAREA_AVG has 182590 (59.38%) missing values Missing
LIVINGAPARTMENTS_AVG has 210199 (68.35%) missing values Missing
LIVINGAREA_AVG has 154350 (50.19%) missing values Missing
NONLIVINGAPARTMENTS_AVG has 213514 (69.43%) missing values Missing
NONLIVINGAREA_AVG has 169682 (55.18%) missing values Missing
APARTMENTS_MODE has 156061 (50.75%) missing values Missing
BASEMENTAREA_MODE has 179943 (58.52%) missing values Missing
YEARS_BEGINEXPLUATATION_MODE has 150007 (48.78%) missing values Missing
YEARS_BUILD_MODE has 204488 (66.5%) missing values Missing
COMMONAREA_MODE has 214865 (69.87%) missing values Missing
ELEVATORS_MODE has 163891 (53.3%) missing values Missing
ENTRANCES_MODE has 154828 (50.35%) missing values Missing
FLOORSMAX_MODE has 153020 (49.76%) missing values Missing
FLOORSMIN_MODE has 208642 (67.85%) missing values Missing
LANDAREA_MODE has 182590 (59.38%) missing values Missing
LIVINGAPARTMENTS_MODE has 210199 (68.35%) missing values Missing
LIVINGAREA_MODE has 154350 (50.19%) missing values Missing
NONLIVINGAPARTMENTS_MODE has 213514 (69.43%) missing values Missing
NONLIVINGAREA_MODE has 169682 (55.18%) missing values Missing
APARTMENTS_MEDI has 156061 (50.75%) missing values Missing
BASEMENTAREA_MEDI has 179943 (58.52%) missing values Missing
YEARS_BEGINEXPLUATATION_MEDI has 150007 (48.78%) missing values Missing
YEARS_BUILD_MEDI has 204488 (66.5%) missing values Missing
COMMONAREA_MEDI has 214865 (69.87%) missing values Missing
ELEVATORS_MEDI has 163891 (53.3%) missing values Missing
ENTRANCES_MEDI has 154828 (50.35%) missing values Missing
FLOORSMAX_MEDI has 153020 (49.76%) missing values Missing
FLOORSMIN_MEDI has 208642 (67.85%) missing values Missing
LANDAREA_MEDI has 182590 (59.38%) missing values Missing
LIVINGAPARTMENTS_MEDI has 210199 (68.35%) missing values Missing
LIVINGAREA_MEDI has 154350 (50.19%) missing values Missing
NONLIVINGAPARTMENTS_MEDI has 213514 (69.43%) missing values Missing
NONLIVINGAREA_MEDI has 169682 (55.18%) missing values Missing
FONDKAPREMONT_MODE has 210295 (68.39%) missing values Missing
HOUSETYPE_MODE has 154297 (50.18%) missing values Missing
TOTALAREA_MODE has 148431 (48.27%) missing values Missing
WALLSMATERIAL_MODE has 156341 (50.84%) missing values Missing
EMERGENCYSTATE_MODE has 145755 (47.4%) missing values Missing
AMT_REQ_CREDIT_BUREAU_HOUR has 41519 (13.5%) missing values Missing
AMT_REQ_CREDIT_BUREAU_DAY has 41519 (13.5%) missing values Missing
AMT_REQ_CREDIT_BUREAU_WEEK has 41519 (13.5%) missing values Missing
AMT_REQ_CREDIT_BUREAU_MON has 41519 (13.5%) missing values Missing
AMT_REQ_CREDIT_BUREAU_QRT has 41519 (13.5%) missing values Missing
AMT_REQ_CREDIT_BUREAU_YEAR has 41519 (13.5%) missing values Missing
CNT_CHILDREN is skewed Skewed
AMT_INCOME_TOTAL is skewed Skewed
AMT_CREDIT is skewed Skewed
AMT_ANNUITY is skewed Skewed
AMT_GOODS_PRICE is skewed Skewed
DAYS_EMPLOYED is skewed Skewed
CNT_FAM_MEMBERS is skewed Skewed
APARTMENTS_AVG is skewed Skewed
BASEMENTAREA_AVG is skewed Skewed
YEARS_BEGINEXPLUATATION_AVG is skewed Skewed
COMMONAREA_AVG is skewed Skewed
ELEVATORS_AVG is skewed Skewed
ENTRANCES_AVG is skewed Skewed
FLOORSMAX_AVG is skewed Skewed
FLOORSMIN_AVG is skewed Skewed
LANDAREA_AVG is skewed Skewed
LIVINGAPARTMENTS_AVG is skewed Skewed
LIVINGAREA_AVG is skewed Skewed
NONLIVINGAPARTMENTS_AVG is skewed Skewed
NONLIVINGAREA_AVG is skewed Skewed
APARTMENTS_MODE is skewed Skewed
BASEMENTAREA_MODE is skewed Skewed
YEARS_BEGINEXPLUATATION_MODE is skewed Skewed
COMMONAREA_MODE is skewed Skewed
ELEVATORS_MODE is skewed Skewed
ENTRANCES_MODE is skewed Skewed
FLOORSMAX_MODE is skewed Skewed
FLOORSMIN_MODE is skewed Skewed
LANDAREA_MODE is skewed Skewed
LIVINGAPARTMENTS_MODE is skewed Skewed
LIVINGAREA_MODE is skewed Skewed
NONLIVINGAPARTMENTS_MODE is skewed Skewed
NONLIVINGAREA_MODE is skewed Skewed
APARTMENTS_MEDI is skewed Skewed
BASEMENTAREA_MEDI is skewed Skewed
YEARS_BEGINEXPLUATATION_MEDI is skewed Skewed
COMMONAREA_MEDI is skewed Skewed
ELEVATORS_MEDI is skewed Skewed
ENTRANCES_MEDI is skewed Skewed
FLOORSMAX_MEDI is skewed Skewed
FLOORSMIN_MEDI is skewed Skewed
LANDAREA_MEDI is skewed Skewed
LIVINGAPARTMENTS_MEDI is skewed Skewed
LIVINGAREA_MEDI is skewed Skewed
NONLIVINGAPARTMENTS_MEDI is skewed Skewed
NONLIVINGAREA_MEDI is skewed Skewed
TOTALAREA_MODE is skewed Skewed
OBS_30_CNT_SOCIAL_CIRCLE is skewed Skewed
DEF_30_CNT_SOCIAL_CIRCLE is skewed Skewed
OBS_60_CNT_SOCIAL_CIRCLE is skewed Skewed
DEF_60_CNT_SOCIAL_CIRCLE is skewed Skewed
DAYS_LAST_PHONE_CHANGE is skewed Skewed
AMT_REQ_CREDIT_BUREAU_DAY is skewed Skewed
AMT_REQ_CREDIT_BUREAU_WEEK is skewed Skewed
AMT_REQ_CREDIT_BUREAU_MON is skewed Skewed
AMT_REQ_CREDIT_BUREAU_QRT is skewed Skewed
AMT_REQ_CREDIT_BUREAU_YEAR is skewed Skewed
ORGANIZATION_TYPE has a high cardinality: 58 distinct values High Cardinality
TARGET has constant length 1 Constant Length
FLAG_OWN_CAR has constant length 1 Constant Length
FLAG_OWN_REALTY has constant length 1 Constant Length
FLAG_MOBIL has constant length 1 Constant Length
FLAG_EMP_PHONE has constant length 1 Constant Length
FLAG_WORK_PHONE has constant length 1 Constant Length
FLAG_CONT_MOBILE has constant length 1 Constant Length
FLAG_PHONE has constant length 1 Constant Length
FLAG_EMAIL has constant length 1 Constant Length
REGION_RATING_CLIENT has constant length 1 Constant Length
REGION_RATING_CLIENT_W_CITY has constant length 1 Constant Length
REG_REGION_NOT_LIVE_REGION has constant length 1 Constant Length
REG_REGION_NOT_WORK_REGION has constant length 1 Constant Length
LIVE_REGION_NOT_WORK_REGION has constant length 1 Constant Length
REG_CITY_NOT_LIVE_CITY has constant length 1 Constant Length
REG_CITY_NOT_WORK_CITY has constant length 1 Constant Length
LIVE_CITY_NOT_WORK_CITY has constant length 1 Constant Length
FLAG_DOCUMENT_2 has constant length 1 Constant Length
FLAG_DOCUMENT_3 has constant length 1 Constant Length
FLAG_DOCUMENT_4 has constant length 1 Constant Length
FLAG_DOCUMENT_5 has constant length 1 Constant Length
FLAG_DOCUMENT_6 has constant length 1 Constant Length
FLAG_DOCUMENT_7 has constant length 1 Constant Length
FLAG_DOCUMENT_8 has constant length 1 Constant Length
FLAG_DOCUMENT_9 has constant length 1 Constant Length
FLAG_DOCUMENT_10 has constant length 1 Constant Length
FLAG_DOCUMENT_11 has constant length 1 Constant Length
FLAG_DOCUMENT_12 has constant length 1 Constant Length
FLAG_DOCUMENT_13 has constant length 1 Constant Length
FLAG_DOCUMENT_14 has constant length 1 Constant Length
FLAG_DOCUMENT_15 has constant length 1 Constant Length
FLAG_DOCUMENT_16 has constant length 1 Constant Length
FLAG_DOCUMENT_17 has constant length 1 Constant Length
FLAG_DOCUMENT_18 has constant length 1 Constant Length
FLAG_DOCUMENT_19 has constant length 1 Constant Length
FLAG_DOCUMENT_20 has constant length 1 Constant Length
FLAG_DOCUMENT_21 has constant length 1 Constant Length
AMT_REQ_CREDIT_BUREAU_HOUR has constant length 3 Constant Length
DAYS_BIRTH has 307511 (100.0%) negatives Negatives
DAYS_EMPLOYED has 252135 (81.99%) negatives Negatives
DAYS_REGISTRATION has 307431 (99.97%) negatives Negatives
DAYS_ID_PUBLISH has 307495 (99.99%) negatives Negatives
DAYS_LAST_PHONE_CHANGE has 269838 (87.75%) negatives Negatives
CNT_CHILDREN has 215371 (70.04%) zeros Zeros
ELEVATORS_AVG has 85718 (27.87%) zeros Zeros
LANDAREA_AVG has 15600 (5.07%) zeros Zeros
NONLIVINGAPARTMENTS_AVG has 54549 (17.74%) zeros Zeros
NONLIVINGAREA_AVG has 58735 (19.1%) zeros Zeros
BASEMENTAREA_MODE has 16598 (5.4%) zeros Zeros
ELEVATORS_MODE has 89498 (29.1%) zeros Zeros
LANDAREA_MODE has 17453 (5.68%) zeros Zeros
NONLIVINGAPARTMENTS_MODE has 59255 (19.27%) zeros Zeros
NONLIVINGAREA_MODE has 67126 (21.83%) zeros Zeros
ELEVATORS_MEDI has 87026 (28.3%) zeros Zeros
LANDAREA_MEDI has 15919 (5.18%) zeros Zeros
NONLIVINGAPARTMENTS_MEDI has 56097 (18.24%) zeros Zeros
NONLIVINGAREA_MEDI has 60954 (19.82%) zeros Zeros
OBS_30_CNT_SOCIAL_CIRCLE has 163910 (53.3%) zeros Zeros
DEF_30_CNT_SOCIAL_CIRCLE has 271324 (88.23%) zeros Zeros
OBS_60_CNT_SOCIAL_CIRCLE has 164666 (53.55%) zeros Zeros
DEF_60_CNT_SOCIAL_CIRCLE has 280721 (91.29%) zeros Zeros
DAYS_LAST_PHONE_CHANGE has 37672 (12.25%) zeros Zeros
AMT_REQ_CREDIT_BUREAU_DAY has 264503 (86.01%) zeros Zeros
AMT_REQ_CREDIT_BUREAU_WEEK has 257456 (83.72%) zeros Zeros
AMT_REQ_CREDIT_BUREAU_MON has 222233 (72.27%) zeros Zeros
AMT_REQ_CREDIT_BUREAU_QRT has 215417 (70.05%) zeros Zeros
AMT_REQ_CREDIT_BUREAU_YEAR has 71801 (23.35%) zeros Zeros
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23

Variables


SK_ID_CURR

numerical

Approximate Distinct Count 307511
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4920176
Mean 278180.5186
Minimum 100002
Maximum 456255
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • SK_ID_CURR is skewed left (γ1 = -0.0012)

Quantile Statistics

Minimum 100002
5-th Percentile 117229.48
Q1 188437.48
Median 277505.49
Q3 366424.74
95-th Percentile 437725.74
Maximum 456255
Range 356253
IQR 177987.26

Descriptive Statistics

Mean 278180.5186
Standard Deviation 102790.1753
Variance 1.0566e+10
Sum 8.5544e+10
Skewness -0.0012
Kurtosis -1.199
Coefficient of Variation 0.3695
  • SK_ID_CURR is not normally distributed (p-value 0.0013217584529078886)

TARGET

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 11.39 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 11.39 times larger than the second largest value (1)
  • TARGET has words of constant length

NAME_CONTRACT_TYPE

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 23209720
  • The largest value (Cash loans) is over 9.5 times larger than the second largest value (Revolving loans)

Length

Mean 10.4761
Standard Deviation 1.4675
Median 10
Minimum 10
Maximum 15

Sample

1st row Cash loans
2nd row Cash loans
3rd row Revolving loans
4th row Cash loans
5th row Cash loans

Letter

Count 2913994
Lowercase Letter 2606483
Space Separator 307511
Uppercase Letter 307511
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Cash loans, Revolving loans) take over 50.0%

CODE_GENDER

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295734
  • The largest value (F) is over 1.93 times larger than the second largest value (M)

Length

Mean 1
Standard Deviation 0.007213
Median 1
Minimum 1
Maximum 3

Sample

1st row M
2nd row F
3rd row M
4th row F
5th row M

Letter

Count 307519
Lowercase Letter 0
Space Separator 0
Uppercase Letter 307519
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (F, M) take over 50.0%
  • The largest value (f) is over 1.93 times larger than the second largest value (m)

FLAG_OWN_CAR

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (N) is over 1.94 times larger than the second largest value (Y)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row N
2nd row N
3rd row Y
4th row N
5th row N

Letter

Count 307511
Lowercase Letter 0
Space Separator 0
Uppercase Letter 307511
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (N, Y) take over 50.0%
  • The largest value (n) is over 1.94 times larger than the second largest value (y)
  • FLAG_OWN_CAR has words of constant length

FLAG_OWN_REALTY

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (Y) is over 2.26 times larger than the second largest value (N)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row Y
2nd row N
3rd row Y
4th row Y
5th row Y

Letter

Count 307511
Lowercase Letter 0
Space Separator 0
Uppercase Letter 307511
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Y, N) take over 50.0%
  • The largest value (y) is over 2.26 times larger than the second largest value (n)
  • FLAG_OWN_REALTY has words of constant length

CNT_CHILDREN

numerical

Approximate Distinct Count 15
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4920176
Mean 0.4171
Minimum 0
Maximum 19
Zeros 215371
Zeros (%) 70.0%
Negatives 0
Negatives (%) 0.0%
  • CNT_CHILDREN is skewed right (γ1 = 1.9746)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 1
95-th Percentile 2
Maximum 19
Range 19
IQR 1

Descriptive Statistics

Mean 0.4171
Standard Deviation 0.7221
Variance 0.5215
Sum 128248
Skewness 1.9746
Kurtosis 7.904
Coefficient of Variation 1.7315
  • CNT_CHILDREN is not normally distributed (p-value 3.271695354937853e-23)
  • CNT_CHILDREN has 4272 outliers

AMT_INCOME_TOTAL

numerical

Approximate Distinct Count 2548
Approximate Unique (%) 0.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4920176
Mean 168797.9193
Minimum 25650
Maximum 1.17e+08
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • AMT_INCOME_TOTAL is skewed right (γ1 = 391.5577)

Quantile Statistics

Minimum 25650
5-th Percentile 67500
Q1 112500
Median 148500
Q3 202500
95-th Percentile 337500
Maximum 1.17e+08
Range 1.1697e+08
IQR 90000

Descriptive Statistics

Mean 168797.9193
Standard Deviation 237123.1463
Variance 5.6227e+10
Sum 5.1907e+10
Skewness 391.5577
Kurtosis 191783.436
Coefficient of Variation 1.4048
  • AMT_INCOME_TOTAL is not normally distributed (p-value 4.226514629279953e-25)
  • AMT_INCOME_TOTAL has 14035 outliers

AMT_CREDIT

numerical

Approximate Distinct Count 5603
Approximate Unique (%) 1.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4920176
Mean 599025.9997
Minimum 45000
Maximum 4.05e+06
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • AMT_CREDIT is skewed right (γ1 = 1.2348)

Quantile Statistics

Minimum 45000
5-th Percentile 137520
Q1 270000
Median 518562
Q3 808650
95-th Percentile 1.35e+06
Maximum 4.05e+06
Range 4.005e+06
IQR 538650

Descriptive Statistics

Mean 599025.9997
Standard Deviation 402490.777
Variance 1.62e+11
Sum 1.8421e+11
Skewness 1.2348
Kurtosis 1.934
Coefficient of Variation 0.6719
  • AMT_CREDIT is not normally distributed (p-value 1.0122933513207945e-07)
  • AMT_CREDIT has 6562 outliers

AMT_ANNUITY

numerical

Approximate Distinct Count 13672
Approximate Unique (%) 4.4%
Missing 12
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4919984
Mean 27108.5739
Minimum 1615.5
Maximum 258025.5
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • AMT_ANNUITY is skewed right (γ1 = 1.5798)

Quantile Statistics

Minimum 1615.5
5-th Percentile 9000
Q1 16573.5
Median 25002
Q3 34749
95-th Percentile 53460
Maximum 258025.5
Range 256410
IQR 18175.5

Descriptive Statistics

Mean 27108.5739
Standard Deviation 14493.7373
Variance 2.1007e+08
Sum 8.3359e+09
Skewness 1.5798
Kurtosis 7.7072
Coefficient of Variation 0.5347
  • AMT_ANNUITY is not normally distributed (p-value 2.5876594545761245e-08)
  • AMT_ANNUITY has 7134 outliers

AMT_GOODS_PRICE

numerical

Approximate Distinct Count 1002
Approximate Unique (%) 0.3%
Missing 278
Missing (%) 0.1%
Infinite 0
Infinite (%) 0.0%
Memory Size 4915728
Mean 538396.2074
Minimum 40500
Maximum 4.05e+06
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • AMT_GOODS_PRICE is skewed right (γ1 = 1.349)

Quantile Statistics

Minimum 40500
5-th Percentile 135000
Q1 238500
Median 450000
Q3 679500
95-th Percentile 1.323e+06
Maximum 4.05e+06
Range 4.0095e+06
IQR 441000

Descriptive Statistics

Mean 538396.2074
Standard Deviation 369446.4605
Variance 1.3649e+11
Sum 1.6541e+11
Skewness 1.349
Kurtosis 2.4319
Coefficient of Variation 0.6862
  • AMT_GOODS_PRICE is not normally distributed (p-value 3.635706003523381e-11)
  • AMT_GOODS_PRICE has 14728 outliers

NAME_TYPE_SUITE

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.0%
Missing 1292
Missing (%) 0.4%
Memory Size 23595170
  • The largest value (Unaccompanied) is over 6.19 times larger than the second largest value (Family)

Length

Mean 12.0533
Standard Deviation 2.5014
Median 13
Minimum 6
Maximum 15

Sample

1st row Unaccompanied
2nd row Family
3rd row Unaccompanied
4th row Unaccompanied
5th row Unaccompanied

Letter

Count 3665017
Lowercase Letter 3356162
Space Separator 11912
Uppercase Letter 308855
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Unaccompanied, Family) take over 50.0%
  • The largest value (unaccompanied) is over 6.19 times larger than the second largest value (family)

NAME_INCOME_TYPE

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 23312901
  • The largest value (Working) is over 2.22 times larger than the second largest value (Commercial associate)

Length

Mean 10.8116
Standard Deviation 5.3003
Median 7
Minimum 7
Maximum 20

Sample

1st row Working
2nd row State servant
3rd row Working
4th row Working
5th row Working

Letter

Count 3231361
Lowercase Letter 2923850
Space Separator 93325
Uppercase Letter 307511
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Working, Commercial associate) take over 50.0%
  • The largest value (working) is over 2.22 times larger than the second largest value (associate)

NAME_EDUCATION_TYPE

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 27753771
  • The largest value (Secondary / secondary special) is over 2.92 times larger than the second largest value (Higher education)

Length

Mean 25.2529
Standard Deviation 5.8695
Median 29
Minimum 15
Maximum 29

Sample

1st row Secondary / second...
2nd row Higher education
3rd row Secondary / second...
4th row Secondary / second...
5th row Secondary / second...

Letter

Count 6802872
Lowercase Letter 6495361
Space Separator 744293
Uppercase Letter 307511
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Secondary / secondary special, Higher education) take over 50.0%
  • The largest value (secondary) is over 2.02 times larger than the second largest value (special)

NAME_FAMILY_STATUS

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 22947353
  • The largest value (Married) is over 4.32 times larger than the second largest value (Single / not married)

Length

Mean 9.6229
Standard Deviation 4.8277
Median 7
Minimum 5
Maximum 20

Sample

1st row Single / not marri...
2nd row Married
3rd row Single / not marri...
4th row Civil marriage
5th row Single / not marri...

Letter

Count 2747587
Lowercase Letter 2440076
Space Separator 166107
Uppercase Letter 307511
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Married, Single / not married) take over 50.0%
  • The largest value (married) is over 5.32 times larger than the second largest value (single)

NAME_HOUSING_TYPE

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 25154326
  • The largest value (House / apartment) is over 18.39 times larger than the second largest value (With parents)

Length

Mean 16.7998
Standard Deviation 1.1622
Median 17
Minimum 12
Maximum 19

Sample

1st row House / apartment
2nd row House / apartment
3rd row House / apartment
4th row House / apartment
5th row House / apartment

Letter

Count 4311742
Lowercase Letter 4004231
Space Separator 580379
Uppercase Letter 307511
Dash Punctuation 1122
Decimal Number 0
  • The top 2 categories (House / apartment, With parents) take over 50.0%

REGION_POPULATION_RELATIVE

numerical

Approximate Distinct Count 81
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4920176
Mean 0.02087
Minimum 0.00029
Maximum 0.07251
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • REGION_POPULATION_RELATIVE is skewed right (γ1 = 1.488)

Quantile Statistics

Minimum 0.00029
5-th Percentile 0.00496
Q1 0.01001
Median 0.01885
Q3 0.02866
95-th Percentile 0.04622
Maximum 0.07251
Range 0.07222
IQR 0.01866

Descriptive Statistics

Mean 0.02087
Standard Deviation 0.01383
Variance 0.0001913
Sum 6417.174
Skewness 1.488
Kurtosis 3.26
Coefficient of Variation 0.6628
  • REGION_POPULATION_RELATIVE is not normally distributed (p-value 0.00018612990959317665)
  • REGION_POPULATION_RELATIVE has 8412 outliers

DAYS_BIRTH

numerical

Approximate Distinct Count 17460
Approximate Unique (%) 5.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4920176
Mean -16036.9951
Minimum -25229
Maximum -7489
Zeros 0
Zeros (%) 0.0%
Negatives 307511
Negatives (%) 100.0%
  • DAYS_BIRTH is skewed left (γ1 = -0.1157)

Quantile Statistics

Minimum -25229
5-th Percentile -23180.95
Q1 -19652
Median -15733
Q3 -12388
95-th Percentile -9362
Maximum -7489
Range 17740
IQR 7264

Descriptive Statistics

Mean -16036.9951
Standard Deviation 4363.9886
Variance 1.9044e+07
Sum -4.9316e+09
Skewness -0.1157
Kurtosis -1.0491
Coefficient of Variation -0.2721
  • DAYS_BIRTH is not normally distributed (p-value 0.005772698802540314)

DAYS_EMPLOYED

numerical

Approximate Distinct Count 12574
Approximate Unique (%) 4.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4920176
Mean 63815.0459
Minimum -17912
Maximum 365243
Zeros 2
Zeros (%) 0.0%
Negatives 252135
Negatives (%) 82.0%
  • DAYS_EMPLOYED is skewed right (γ1 = 1.6643)

Quantile Statistics

Minimum -17912
5-th Percentile -6654.95
Q1 -2717
Median -1209
Q3 -286.25
95-th Percentile 365243
Maximum 365243
Range 383155
IQR 2430.75

Descriptive Statistics

Mean 63815.0459
Standard Deviation 141275.7665
Variance 1.9959e+10
Sum 1.9624e+10
Skewness 1.6643
Kurtosis 0.7716
Coefficient of Variation 2.2138
  • DAYS_EMPLOYED is not normally distributed (p-value 5.419963280899942e-20)
  • DAYS_EMPLOYED has 72751 outliers

DAYS_REGISTRATION

numerical

Approximate Distinct Count 15688
Approximate Unique (%) 5.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4920176
Mean -4986.1203
Minimum -24672
Maximum 0
Zeros 80
Zeros (%) 0.0%
Negatives 307431
Negatives (%) 100.0%
  • DAYS_REGISTRATION is skewed left (γ1 = -0.5909)

Quantile Statistics

Minimum -24672
5-th Percentile -11397
Q1 -7457
Median -4479
Q3 -1989
95-th Percentile -319.05
Maximum 0
Range 24672
IQR 5468

Descriptive Statistics

Mean -4986.1203
Standard Deviation 3522.8863
Variance 1.2411e+07
Sum -1.5333e+09
Skewness -0.5909
Kurtosis -0.3214
Coefficient of Variation -0.7065
  • DAYS_REGISTRATION has 674 outliers

DAYS_ID_PUBLISH

numerical

Approximate Distinct Count 6168
Approximate Unique (%) 2.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4920176
Mean -2994.2024
Minimum -7197
Maximum 0
Zeros 16
Zeros (%) 0.0%
Negatives 307495
Negatives (%) 100.0%
  • DAYS_ID_PUBLISH is skewed right (γ1 = 0.3493)

Quantile Statistics

Minimum -7197
5-th Percentile -4942
Q1 -4296
Median -3237
Q3 -1709
95-th Percentile -369.05
Maximum 0
Range 7197
IQR 2587

Descriptive Statistics

Mean -2994.2024
Standard Deviation 1509.4504
Variance 2.2784e+06
Sum -9.2075e+08
Skewness 0.3493
Kurtosis -1.1068
Coefficient of Variation -0.5041

OWN_CAR_AGE

numerical

Approximate Distinct Count 62
Approximate Unique (%) 0.1%
Missing 202929
Missing (%) 66.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1673312
Mean 12.0611
Minimum 0
Maximum 91
Zeros 2134
Zeros (%) 0.7%
Negatives 0
Negatives (%) 0.0%
  • OWN_CAR_AGE is skewed right (γ1 = 2.7454)

Quantile Statistics

Minimum 0
5-th Percentile 1
Q1 5
Median 9
Q3 15
95-th Percentile 30
Maximum 91
Range 91
IQR 10

Descriptive Statistics

Mean 12.0611
Standard Deviation 11.9448
Variance 142.6785
Sum 1.2614e+06
Skewness 2.7454
Kurtosis 9.2144
Coefficient of Variation 0.9904
  • OWN_CAR_AGE is not normally distributed (p-value 6.532370095768674e-06)
  • OWN_CAR_AGE has 4932 outliers

FLAG_MOBIL

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (1) is over 307510.0 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 307510.0 times larger than the second largest value (0)
  • FLAG_MOBIL has words of constant length

FLAG_EMP_PHONE

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (1) is over 4.55 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 4.55 times larger than the second largest value (0)
  • FLAG_EMP_PHONE has words of constant length

FLAG_WORK_PHONE

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 4.02 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 1
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 4.02 times larger than the second largest value (1)
  • FLAG_WORK_PHONE has words of constant length

FLAG_CONT_MOBILE

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (1) is over 534.73 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 534.73 times larger than the second largest value (0)
  • FLAG_CONT_MOBILE has words of constant length

FLAG_PHONE

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 2.56 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 2.56 times larger than the second largest value (1)
  • FLAG_PHONE has words of constant length

FLAG_EMAIL

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 16.63 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 16.63 times larger than the second largest value (1)
  • FLAG_EMAIL has words of constant length

OCCUPATION_TYPE

categorical

Approximate Distinct Count 18
Approximate Unique (%) 0.0%
Missing 96391
Missing (%) 31.4%
Memory Size 15950430
  • The largest value (Laborers) is over 1.72 times larger than the second largest value (Sales staff)

Length

Mean 10.5515
Standard Deviation 3.6434
Median 10
Minimum 7
Maximum 21

Sample

1st row Laborers
2nd row Core staff
3rd row Laborers
4th row Laborers
5th row Core staff

Letter

Count 2093935
Lowercase Letter 1879633
Space Separator 130254
Uppercase Letter 214302
Dash Punctuation 2093
Decimal Number 0
  • The largest value (staff) is over 1.78 times larger than the second largest value (laborers)

CNT_FAM_MEMBERS

numerical

Approximate Distinct Count 17
Approximate Unique (%) 0.0%
Missing 2
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4920144
Mean 2.1527
Minimum 1
Maximum 20
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • CNT_FAM_MEMBERS is skewed right (γ1 = 0.9875)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 2
Median 2
Q3 3
95-th Percentile 4
Maximum 20
Range 19
IQR 1

Descriptive Statistics

Mean 2.1527
Standard Deviation 0.9107
Variance 0.8293
Sum 661964
Skewness 0.9875
Kurtosis 2.8019
Coefficient of Variation 0.423
  • CNT_FAM_MEMBERS is not normally distributed (p-value 5.732244714995868e-20)
  • CNT_FAM_MEMBERS has 4007 outliers

REGION_RATING_CLIENT

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (2) is over 4.7 times larger than the second largest value (3)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 2
2nd row 1
3rd row 2
4th row 2
5th row 2

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (2, 3) take over 50.0%
  • The largest value (2) is over 4.7 times larger than the second largest value (3)
  • REGION_RATING_CLIENT has words of constant length

REGION_RATING_CLIENT_W_CITY

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (2) is over 5.23 times larger than the second largest value (3)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 2
2nd row 1
3rd row 2
4th row 2
5th row 2

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (2, 3) take over 50.0%
  • The largest value (2) is over 5.23 times larger than the second largest value (3)
  • REGION_RATING_CLIENT_W_CITY has words of constant length

WEEKDAY_APPR_PROCESS_START

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 22211870

Length

Mean 7.2311
Standard Deviation 1.1305
Median 7
Minimum 6
Maximum 9

Sample

1st row WEDNESDAY
2nd row MONDAY
3rd row MONDAY
4th row WEDNESDAY
5th row THURSDAY

Letter

Count 2223655
Lowercase Letter 0
Space Separator 0
Uppercase Letter 2223655
Dash Punctuation 0
Decimal Number 0

HOUR_APPR_PROCESS_START

numerical

Approximate Distinct Count 24
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4920176
Mean 12.0634
Minimum 0
Maximum 23
Zeros 40
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • HOUR_APPR_PROCESS_START is skewed left (γ1 = -0.028)

Quantile Statistics

Minimum 0
5-th Percentile 7
Q1 10
Median 12
Q3 14
95-th Percentile 17
Maximum 23
Range 23
IQR 4

Descriptive Statistics

Mean 12.0634
Standard Deviation 3.2658
Variance 10.6657
Sum 3.7096e+06
Skewness -0.02802
Kurtosis -0.1942
Coefficient of Variation 0.2707
  • HOUR_APPR_PROCESS_START is not normally distributed (p-value 1.5346588055171966e-05)
  • HOUR_APPR_PROCESS_START has 2257 outliers

REG_REGION_NOT_LIVE_REGION

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 65.03 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 65.03 times larger than the second largest value (1)
  • REG_REGION_NOT_LIVE_REGION has words of constant length

REG_REGION_NOT_WORK_REGION

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 18.7 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 18.7 times larger than the second largest value (1)
  • REG_REGION_NOT_WORK_REGION has words of constant length

LIVE_REGION_NOT_WORK_REGION

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 23.59 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 23.59 times larger than the second largest value (1)
  • LIVE_REGION_NOT_WORK_REGION has words of constant length

REG_CITY_NOT_LIVE_CITY

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 11.79 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 11.79 times larger than the second largest value (1)
  • REG_CITY_NOT_LIVE_CITY has words of constant length

REG_CITY_NOT_WORK_CITY

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 3.34 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 3.34 times larger than the second largest value (1)
  • REG_CITY_NOT_WORK_CITY has words of constant length

LIVE_CITY_NOT_WORK_CITY

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 4.57 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 4.57 times larger than the second largest value (1)
  • LIVE_CITY_NOT_WORK_CITY has words of constant length

ORGANIZATION_TYPE

categorical

Approximate Distinct Count 58
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 23840108

Length

Mean 12.526
Standard Deviation 7.0963
Median 13
Minimum 3
Maximum 22

Sample

1st row Business Entity Ty...
2nd row School
3rd row Government
4th row Business Entity Ty...
5th row Religion

Letter

Count 3319373
Lowercase Letter 2729777
Space Separator 331098
Uppercase Letter 589596
Dash Punctuation 38412
Decimal Number 125394

EXT_SOURCE_1

numerical

Approximate Distinct Count 114584
Approximate Unique (%) 85.4%
Missing 173378
Missing (%) 56.4%
Infinite 0
Infinite (%) 0.0%
Memory Size 2146128
Mean 0.5021
Minimum 0.01457
Maximum 0.9627
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • EXT_SOURCE_1 is skewed left (γ1 = -0.0688)

Quantile Statistics

Minimum 0.01457
5-th Percentile 0.1595
Q1 0.3355
Median 0.5081
Q3 0.6765
95-th Percentile 0.8352
Maximum 0.9627
Range 0.9481
IQR 0.341

Descriptive Statistics

Mean 0.5021
Standard Deviation 0.2111
Variance 0.04455
Sum 67352.1772
Skewness -0.06875
Kurtosis -0.9652
Coefficient of Variation 0.4203

EXT_SOURCE_2

numerical

Approximate Distinct Count 119831
Approximate Unique (%) 39.1%
Missing 660
Missing (%) 0.2%
Infinite 0
Infinite (%) 0.0%
Memory Size 4909616
Mean 0.5144
Minimum 8.1736e-08
Maximum 0.855
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • EXT_SOURCE_2 is skewed left (γ1 = -0.7936)

Quantile Statistics

Minimum 8.1736e-08
5-th Percentile 0.1358
Q1 0.3965
Median 0.5666
Q3 0.6644
95-th Percentile 0.7485
Maximum 0.855
Range 0.855
IQR 0.2679

Descriptive Statistics

Mean 0.5144
Standard Deviation 0.1911
Variance 0.0365
Sum 157841.9064
Skewness -0.7936
Kurtosis -0.2691
Coefficient of Variation 0.3714

EXT_SOURCE_3

numerical

Approximate Distinct Count 814
Approximate Unique (%) 0.3%
Missing 60965
Missing (%) 19.8%
Infinite 0
Infinite (%) 0.0%
Memory Size 3944736
Mean 0.5109
Minimum 0.00052727
Maximum 0.896
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • EXT_SOURCE_3 is skewed left (γ1 = -0.4094)

Quantile Statistics

Minimum 0.00052727
5-th Percentile 0.1566
Q1 0.3723
Median 0.5371
Q3 0.6691
95-th Percentile 0.7887
Maximum 0.896
Range 0.8955
IQR 0.2967

Descriptive Statistics

Mean 0.5109
Standard Deviation 0.1948
Variance 0.03796
Sum 125948.7406
Skewness -0.4094
Kurtosis -0.6635
Coefficient of Variation 0.3814
  • EXT_SOURCE_3 is not normally distributed (p-value 0.0022493586921736365)

APARTMENTS_AVG

numerical

Approximate Distinct Count 2339
Approximate Unique (%) 1.5%
Missing 156061
Missing (%) 50.7%
Infinite 0
Infinite (%) 0.0%
Memory Size 2423200
Mean 0.1174
Minimum 0
Maximum 1
Zeros 751
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • APARTMENTS_AVG is skewed right (γ1 = 2.6418)

Quantile Statistics

Minimum 0
5-th Percentile 0.0093
Q1 0.0577
Median 0.0876
Q3 0.1485
95-th Percentile 0.3299
Maximum 1
Range 1
IQR 0.0908

Descriptive Statistics

Mean 0.1174
Standard Deviation 0.1082
Variance 0.01172
Sum 17786.3636
Skewness 2.6418
Kurtosis 11.3934
Coefficient of Variation 0.9217
  • APARTMENTS_AVG is not normally distributed (p-value 4.3796885194118934e-09)
  • APARTMENTS_AVG has 10655 outliers

BASEMENTAREA_AVG

numerical

Approximate Distinct Count 3780
Approximate Unique (%) 3.0%
Missing 179943
Missing (%) 58.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 2041088
Mean 0.08844
Minimum 0
Maximum 1
Zeros 14745
Zeros (%) 4.8%
Negatives 0
Negatives (%) 0.0%
  • BASEMENTAREA_AVG is skewed right (γ1 = 3.5663)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0.0446
Median 0.0767
Q3 0.1127
95-th Percentile 0.2263
Maximum 1
Range 1
IQR 0.0681

Descriptive Statistics

Mean 0.08844
Standard Deviation 0.08244
Variance 0.006796
Sum 11282.397
Skewness 3.5663
Kurtosis 25.9291
Coefficient of Variation 0.9321
  • BASEMENTAREA_AVG is not normally distributed (p-value 2.967289412862341e-09)
  • BASEMENTAREA_AVG has 7140 outliers

YEARS_BEGINEXPLUATATION_AVG

numerical

Approximate Distinct Count 285
Approximate Unique (%) 0.2%
Missing 150007
Missing (%) 48.8%
Infinite 0
Infinite (%) 0.0%
Memory Size 2520064
Mean 0.9777
Minimum 0
Maximum 1
Zeros 514
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • YEARS_BEGINEXPLUATATION_AVG is skewed left (γ1 = -15.5151)

Quantile Statistics

Minimum 0
5-th Percentile 0.9687
Q1 0.9767
Median 0.9821
Q3 0.9866
95-th Percentile 0.996
Maximum 1
Range 1
IQR 0.0099

Descriptive Statistics

Mean 0.9777
Standard Deviation 0.05922
Variance 0.003507
Sum 153997.1511
Skewness -15.5151
Kurtosis 248.1684
Coefficient of Variation 0.06057
  • YEARS_BEGINEXPLUATATION_AVG is not normally distributed (p-value 3.548100482225903e-20)
  • YEARS_BEGINEXPLUATATION_AVG has 4784 outliers

YEARS_BUILD_AVG

numerical

Approximate Distinct Count 149
Approximate Unique (%) 0.1%
Missing 204488
Missing (%) 66.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 1648368
Mean 0.7525
Minimum 0
Maximum 1
Zeros 102
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • YEARS_BUILD_AVG is skewed left (γ1 = -0.9625)

Quantile Statistics

Minimum 0
5-th Percentile 0.5988
Q1 0.6872
Median 0.7552
Q3 0.8232
95-th Percentile 0.9524
Maximum 1
Range 1
IQR 0.136

Descriptive Statistics

Mean 0.7525
Standard Deviation 0.1133
Variance 0.01283
Sum 77521.8644
Skewness -0.9625
Kurtosis 4.3995
Coefficient of Variation 0.1505
  • YEARS_BUILD_AVG is not normally distributed (p-value 0.0035686708844317227)
  • YEARS_BUILD_AVG has 2154 outliers

COMMONAREA_AVG

numerical

Approximate Distinct Count 3181
Approximate Unique (%) 3.4%
Missing 214865
Missing (%) 69.9%
Infinite 0
Infinite (%) 0.0%
Memory Size 1482336
Mean 0.04462
Minimum 0
Maximum 1
Zeros 8442
Zeros (%) 2.7%
Negatives 0
Negatives (%) 0.0%
  • COMMONAREA_AVG is skewed right (γ1 = 5.4572)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0.0079
Median 0.0214
Q3 0.0518
95-th Percentile 0.1618
Maximum 1
Range 1
IQR 0.0439

Descriptive Statistics

Mean 0.04462
Standard Deviation 0.07604
Variance 0.005781
Sum 4133.9308
Skewness 5.4572
Kurtosis 45.9855
Coefficient of Variation 1.704
  • COMMONAREA_AVG is not normally distributed (p-value 3.926208529964042e-21)
  • COMMONAREA_AVG has 7883 outliers

ELEVATORS_AVG

numerical

Approximate Distinct Count 257
Approximate Unique (%) 0.2%
Missing 163891
Missing (%) 53.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 2297920
Mean 0.07894
Minimum 0
Maximum 1
Zeros 85718
Zeros (%) 27.9%
Negatives 0
Negatives (%) 0.0%
  • ELEVATORS_AVG is skewed right (γ1 = 2.4394)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0.12
95-th Percentile 0.36
Maximum 1
Range 1
IQR 0.12

Descriptive Statistics

Mean 0.07894
Standard Deviation 0.1346
Variance 0.01811
Sum 11337.58
Skewness 2.4394
Kurtosis 7.8691
Coefficient of Variation 1.7048
  • ELEVATORS_AVG is not normally distributed (p-value 2.8459148867640713e-24)
  • ELEVATORS_AVG has 10420 outliers

ENTRANCES_AVG

numerical

Approximate Distinct Count 285
Approximate Unique (%) 0.2%
Missing 154828
Missing (%) 50.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 2442928
Mean 0.1497
Minimum 0
Maximum 1
Zeros 323
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • ENTRANCES_AVG is skewed right (γ1 = 2.3997)

Quantile Statistics

Minimum 0
5-th Percentile 0.0345
Q1 0.069
Median 0.1379
Q3 0.2069
95-th Percentile 0.3103
Maximum 1
Range 1
IQR 0.1379

Descriptive Statistics

Mean 0.1497
Standard Deviation 0.1
Variance 0.01001
Sum 22860.4118
Skewness 2.3997
Kurtosis 11.5928
Coefficient of Variation 0.6682
  • ENTRANCES_AVG is not normally distributed (p-value 4.133926501741984e-12)
  • ENTRANCES_AVG has 3882 outliers

FLOORSMAX_AVG

numerical

Approximate Distinct Count 403
Approximate Unique (%) 0.3%
Missing 153020
Missing (%) 49.8%
Infinite 0
Infinite (%) 0.0%
Memory Size 2471856
Mean 0.2263
Minimum 0
Maximum 1
Zeros 2938
Zeros (%) 1.0%
Negatives 0
Negatives (%) 0.0%
  • FLOORSMAX_AVG is skewed right (γ1 = 1.2264)

Quantile Statistics

Minimum 0
5-th Percentile 0.0417
Q1 0.1667
Median 0.1667
Q3 0.3333
95-th Percentile 0.5
Maximum 1
Range 1
IQR 0.1666

Descriptive Statistics

Mean 0.2263
Standard Deviation 0.1446
Variance 0.02092
Sum 34958.5181
Skewness 1.2264
Kurtosis 2.4324
Coefficient of Variation 0.6392
  • FLOORSMAX_AVG is not normally distributed (p-value 1.0921159448364393e-19)
  • FLOORSMAX_AVG has 5215 outliers

FLOORSMIN_AVG

numerical

Approximate Distinct Count 305
Approximate Unique (%) 0.3%
Missing 208642
Missing (%) 67.8%
Infinite 0
Infinite (%) 0.0%
Memory Size 1581904
Mean 0.2319
Minimum 0
Maximum 1
Zeros 2320
Zeros (%) 0.8%
Negatives 0
Negatives (%) 0.0%
  • FLOORSMIN_AVG is skewed right (γ1 = 0.9542)

Quantile Statistics

Minimum 0
5-th Percentile 0.0417
Q1 0.0833
Median 0.2083
Q3 0.375
95-th Percentile 0.5
Maximum 1
Range 1
IQR 0.2917

Descriptive Statistics

Mean 0.2319
Standard Deviation 0.1614
Variance 0.02604
Sum 22927.0785
Skewness 0.9542
Kurtosis 1.3381
Coefficient of Variation 0.6959
  • FLOORSMIN_AVG is not normally distributed (p-value 5.339819877297509e-17)
  • FLOORSMIN_AVG has 343 outliers

LANDAREA_AVG

numerical

Approximate Distinct Count 3527
Approximate Unique (%) 2.8%
Missing 182590
Missing (%) 59.4%
Infinite 0
Infinite (%) 0.0%
Memory Size 1998736
Mean 0.06633
Minimum 0
Maximum 1
Zeros 15600
Zeros (%) 5.1%
Negatives 0
Negatives (%) 0.0%
  • LANDAREA_AVG is skewed right (γ1 = 4.4586)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0.0192
Median 0.0486
Q3 0.0863
95-th Percentile 0.1971
Maximum 1
Range 1
IQR 0.0671

Descriptive Statistics

Mean 0.06633
Standard Deviation 0.08118
Variance 0.006591
Sum 8286.4077
Skewness 4.4586
Kurtosis 34.7434
Coefficient of Variation 1.2239
  • LANDAREA_AVG is not normally distributed (p-value 3.5053324240742713e-13)
  • LANDAREA_AVG has 6824 outliers

LIVINGAPARTMENTS_AVG

numerical

Approximate Distinct Count 1868
Approximate Unique (%) 1.9%
Missing 210199
Missing (%) 68.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 1556992
Mean 0.1008
Minimum 0
Maximum 1
Zeros 418
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • LIVINGAPARTMENTS_AVG is skewed right (γ1 = 3.0422)

Quantile Statistics

Minimum 0
5-th Percentile 0.0101
Q1 0.0504
Median 0.0756
Q3 0.121
95-th Percentile 0.2749
Maximum 1
Range 1
IQR 0.0706

Descriptive Statistics

Mean 0.1008
Standard Deviation 0.09258
Variance 0.00857
Sum 9806.5949
Skewness 3.0422
Kurtosis 16.4897
Coefficient of Variation 0.9186
  • LIVINGAPARTMENTS_AVG is not normally distributed (p-value 2.0040085424296816e-10)
  • LIVINGAPARTMENTS_AVG has 7881 outliers

LIVINGAREA_AVG

numerical

Approximate Distinct Count 5199
Approximate Unique (%) 3.4%
Missing 154350
Missing (%) 50.2%
Infinite 0
Infinite (%) 0.0%
Memory Size 2450576
Mean 0.1074
Minimum 0
Maximum 1
Zeros 284
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • LIVINGAREA_AVG is skewed right (γ1 = 2.8547)

Quantile Statistics

Minimum 0
5-th Percentile 0.0084
Q1 0.0458
Median 0.0746
Q3 0.1312
95-th Percentile 0.3236
Maximum 1
Range 1
IQR 0.0854

Descriptive Statistics

Mean 0.1074
Standard Deviation 0.1106
Variance 0.01222
Sum 16449.3412
Skewness 2.8547
Kurtosis 12.3307
Coefficient of Variation 1.0295
  • LIVINGAREA_AVG is not normally distributed (p-value 1.7538728490615943e-09)
  • LIVINGAREA_AVG has 12262 outliers

NONLIVINGAPARTMENTS_AVG

numerical

Approximate Distinct Count 386
Approximate Unique (%) 0.4%
Missing 213514
Missing (%) 69.4%
Infinite 0
Infinite (%) 0.0%
Memory Size 1503952
Mean 0.008809
Minimum 0
Maximum 1
Zeros 54549
Zeros (%) 17.7%
Negatives 0
Negatives (%) 0.0%
  • NONLIVINGAPARTMENTS_AVG is skewed right (γ1 = 15.5409)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0.0051
95-th Percentile 0.0309
Maximum 1
Range 1
IQR 0.0051

Descriptive Statistics

Mean 0.008809
Standard Deviation 0.04773
Variance 0.002278
Sum 827.9888
Skewness 15.5409
Kurtosis 284.7151
Coefficient of Variation 5.4187
  • NONLIVINGAPARTMENTS_AVG is not normally distributed (p-value 4.810962253222664e-25)
  • NONLIVINGAPARTMENTS_AVG has 11754 outliers

NONLIVINGAREA_AVG

numerical

Approximate Distinct Count 3290
Approximate Unique (%) 2.4%
Missing 169682
Missing (%) 55.2%
Infinite 0
Infinite (%) 0.0%
Memory Size 2205264
Mean 0.02836
Minimum 0
Maximum 1
Zeros 58735
Zeros (%) 19.1%
Negatives 0
Negatives (%) 0.0%
  • NONLIVINGAREA_AVG is skewed right (γ1 = 6.5589)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0.0037
Q3 0.0281
95-th Percentile 0.13
Maximum 1
Range 1
IQR 0.0281

Descriptive Statistics

Mean 0.02836
Standard Deviation 0.06952
Variance 0.004833
Sum 3908.5213
Skewness 6.5589
Kurtosis 64.91
Coefficient of Variation 2.4516
  • NONLIVINGAREA_AVG is not normally distributed (p-value 1.926001817071826e-24)
  • NONLIVINGAREA_AVG has 16317 outliers

APARTMENTS_MODE

numerical

Approximate Distinct Count 760
Approximate Unique (%) 0.5%
Missing 156061
Missing (%) 50.7%
Infinite 0
Infinite (%) 0.0%
Memory Size 2423200
Mean 0.1142
Minimum 0
Maximum 1
Zeros 976
Zeros (%) 0.3%
Negatives 0
Negatives (%) 0.0%
  • APARTMENTS_MODE is skewed right (γ1 = 2.703)

Quantile Statistics

Minimum 0
5-th Percentile 0.0084
Q1 0.0525
Median 0.084
Q3 0.146
95-th Percentile 0.3214
Maximum 1
Range 1
IQR 0.0935

Descriptive Statistics

Mean 0.1142
Standard Deviation 0.1079
Variance 0.01165
Sum 17300.286
Skewness 2.703
Kurtosis 11.7456
Coefficient of Variation 0.9449
  • APARTMENTS_MODE is not normally distributed (p-value 1.2576441354391088e-09)
  • APARTMENTS_MODE has 10007 outliers

BASEMENTAREA_MODE

numerical

Approximate Distinct Count 3841
Approximate Unique (%) 3.0%
Missing 179943
Missing (%) 58.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 2041088
Mean 0.08754
Minimum 0
Maximum 1
Zeros 16598
Zeros (%) 5.4%
Negatives 0
Negatives (%) 0.0%
  • BASEMENTAREA_MODE is skewed right (γ1 = 3.4815)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0.0409
Median 0.07495
Q3 0.1132
95-th Percentile 0.2295
Maximum 1
Range 1
IQR 0.0723

Descriptive Statistics

Mean 0.08754
Standard Deviation 0.08431
Variance 0.007108
Sum 11167.7125
Skewness 3.4815
Kurtosis 24.4302
Coefficient of Variation 0.963
  • BASEMENTAREA_MODE is not normally distributed (p-value 3.180779942733235e-09)
  • BASEMENTAREA_MODE has 6791 outliers

YEARS_BEGINEXPLUATATION_MODE

numerical

Approximate Distinct Count 221
Approximate Unique (%) 0.1%
Missing 150007
Missing (%) 48.8%
Infinite 0
Infinite (%) 0.0%
Memory Size 2520064
Mean 0.9771
Minimum 0
Maximum 1
Zeros 142
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • YEARS_BEGINEXPLUATATION_MODE is skewed left (γ1 = -14.7552)

Quantile Statistics

Minimum 0
5-th Percentile 0.9682
Q1 0.9767
Median 0.9816
Q3 0.9866
95-th Percentile 0.996
Maximum 1
Range 1
IQR 0.0099

Descriptive Statistics

Mean 0.9771
Standard Deviation 0.06458
Variance 0.00417
Sum 153891.7045
Skewness -14.7552
Kurtosis 219.9557
Coefficient of Variation 0.06609
  • YEARS_BEGINEXPLUATATION_MODE is not normally distributed (p-value 6.826966263184605e-20)
  • YEARS_BEGINEXPLUATATION_MODE has 5074 outliers

YEARS_BUILD_MODE

numerical

Approximate Distinct Count 154
Approximate Unique (%) 0.1%
Missing 204488
Missing (%) 66.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 1648368
Mean 0.7596
Minimum 0
Maximum 1
Zeros 103
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • YEARS_BUILD_MODE is skewed left (γ1 = -1.0023)

Quantile Statistics

Minimum 0
5-th Percentile 0.608
Q1 0.6994
Median 0.7648
Q3 0.8236
95-th Percentile 0.9543
Maximum 1
Range 1
IQR 0.1242

Descriptive Statistics

Mean 0.7596
Standard Deviation 0.1101
Variance 0.01212
Sum 78260.1159
Skewness -1.0023
Kurtosis 4.7642
Coefficient of Variation 0.145
  • YEARS_BUILD_MODE is not normally distributed (p-value 0.003969181086751647)
  • YEARS_BUILD_MODE has 2537 outliers

COMMONAREA_MODE

numerical

Approximate Distinct Count 3128
Approximate Unique (%) 3.4%
Missing 214865
Missing (%) 69.9%
Infinite 0
Infinite (%) 0.0%
Memory Size 1482336
Mean 0.04255
Minimum 0
Maximum 1
Zeros 9690
Zeros (%) 3.2%
Negatives 0
Negatives (%) 0.0%
  • COMMONAREA_MODE is skewed right (γ1 = 5.6205)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0.0073
Median 0.0193
Q3 0.0494
95-th Percentile 0.1549
Maximum 1
Range 1
IQR 0.0421

Descriptive Statistics

Mean 0.04255
Standard Deviation 0.07444
Variance 0.005542
Sum 3942.378
Skewness 5.6205
Kurtosis 48.8583
Coefficient of Variation 1.7494
  • COMMONAREA_MODE is not normally distributed (p-value 1.2253707034244534e-21)
  • COMMONAREA_MODE has 7852 outliers

ELEVATORS_MODE

numerical

Approximate Distinct Count 26
Approximate Unique (%) 0.0%
Missing 163891
Missing (%) 53.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 2297920
Mean 0.07449
Minimum 0
Maximum 1
Zeros 89498
Zeros (%) 29.1%
Negatives 0
Negatives (%) 0.0%
  • ELEVATORS_MODE is skewed right (γ1 = 2.5523)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0.1208
95-th Percentile 0.3222
Maximum 1
Range 1
IQR 0.1208

Descriptive Statistics

Mean 0.07449
Standard Deviation 0.1323
Variance 0.01749
Sum 10698.2159
Skewness 2.5523
Kurtosis 8.5976
Coefficient of Variation 1.7755
  • ELEVATORS_MODE is not normally distributed (p-value 3.240315932347155e-24)
  • ELEVATORS_MODE has 9732 outliers

ENTRANCES_MODE

numerical

Approximate Distinct Count 30
Approximate Unique (%) 0.0%
Missing 154828
Missing (%) 50.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 2442928
Mean 0.1452
Minimum 0
Maximum 1
Zeros 387
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • ENTRANCES_MODE is skewed right (γ1 = 2.3923)

Quantile Statistics

Minimum 0
5-th Percentile 0.0345
Q1 0.069
Median 0.1379
Q3 0.2069
95-th Percentile 0.3103
Maximum 1
Range 1
IQR 0.1379

Descriptive Statistics

Mean 0.1452
Standard Deviation 0.101
Variance 0.0102
Sum 22168.4507
Skewness 2.3923
Kurtosis 11.422
Coefficient of Variation 0.6955
  • ENTRANCES_MODE is not normally distributed (p-value 1.594454422444803e-11)
  • ENTRANCES_MODE has 3840 outliers

FLOORSMAX_MODE

numerical

Approximate Distinct Count 25
Approximate Unique (%) 0.0%
Missing 153020
Missing (%) 49.8%
Infinite 0
Infinite (%) 0.0%
Memory Size 2471856
Mean 0.2223
Minimum 0
Maximum 1
Zeros 3415
Zeros (%) 1.1%
Negatives 0
Negatives (%) 0.0%
  • FLOORSMAX_MODE is skewed right (γ1 = 1.2443)

Quantile Statistics

Minimum 0
5-th Percentile 0.0417
Q1 0.1667
Median 0.1667
Q3 0.3333
95-th Percentile 0.4583
Maximum 1
Range 1
IQR 0.1666

Descriptive Statistics

Mean 0.2223
Standard Deviation 0.1437
Variance 0.02065
Sum 34345.674
Skewness 1.2443
Kurtosis 2.5356
Coefficient of Variation 0.6464
  • FLOORSMAX_MODE is not normally distributed (p-value 1.3773490250308146e-19)
  • FLOORSMAX_MODE has 5104 outliers

FLOORSMIN_MODE

numerical

Approximate Distinct Count 25
Approximate Unique (%) 0.0%
Missing 208642
Missing (%) 67.8%
Infinite 0
Infinite (%) 0.0%
Memory Size 1581904
Mean 0.2281
Minimum 0
Maximum 1
Zeros 2517
Zeros (%) 0.8%
Negatives 0
Negatives (%) 0.0%
  • FLOORSMIN_MODE is skewed right (γ1 = 0.9638)

Quantile Statistics

Minimum 0
5-th Percentile 0.0417
Q1 0.0833
Median 0.2083
Q3 0.375
95-th Percentile 0.5
Maximum 1
Range 1
IQR 0.2917

Descriptive Statistics

Mean 0.2281
Standard Deviation 0.1612
Variance 0.02597
Sum 22547.9151
Skewness 0.9638
Kurtosis 1.3536
Coefficient of Variation 0.7067
  • FLOORSMIN_MODE is not normally distributed (p-value 7.399190557475883e-17)
  • FLOORSMIN_MODE has 320 outliers

LANDAREA_MODE

numerical

Approximate Distinct Count 3563
Approximate Unique (%) 2.9%
Missing 182590
Missing (%) 59.4%
Infinite 0
Infinite (%) 0.0%
Memory Size 1998736
Mean 0.06496
Minimum 0
Maximum 1
Zeros 17453
Zeros (%) 5.7%
Negatives 0
Negatives (%) 0.0%
  • LANDAREA_MODE is skewed right (γ1 = 4.377)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0.017
Median 0.0462
Q3 0.0847
95-th Percentile 0.1984
Maximum 1
Range 1
IQR 0.0677

Descriptive Statistics

Mean 0.06496
Standard Deviation 0.08175
Variance 0.006683
Sum 8114.5789
Skewness 4.377
Kurtosis 33.2719
Coefficient of Variation 1.2585
  • LANDAREA_MODE is not normally distributed (p-value 2.7820376143291507e-14)
  • LANDAREA_MODE has 6953 outliers

LIVINGAPARTMENTS_MODE

numerical

Approximate Distinct Count 736
Approximate Unique (%) 0.8%
Missing 210199
Missing (%) 68.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 1556992
Mean 0.1056
Minimum 0
Maximum 1
Zeros 519
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • LIVINGAPARTMENTS_MODE is skewed right (γ1 = 2.9026)

Quantile Statistics

Minimum 0
5-th Percentile 0.011
Q1 0.0542
Median 0.0771
Q3 0.1313
95-th Percentile 0.2966
Maximum 1
Range 1
IQR 0.0771

Descriptive Statistics

Mean 0.1056
Standard Deviation 0.09788
Variance 0.009581
Sum 10280.5123
Skewness 2.9026
Kurtosis 14.2242
Coefficient of Variation 0.9265
  • LIVINGAPARTMENTS_MODE is not normally distributed (p-value 1.3677934664892571e-09)
  • LIVINGAPARTMENTS_MODE has 7469 outliers

LIVINGAREA_MODE

numerical

Approximate Distinct Count 5301
Approximate Unique (%) 3.5%
Missing 154350
Missing (%) 50.2%
Infinite 0
Infinite (%) 0.0%
Memory Size 2450576
Mean 0.106
Minimum 0
Maximum 1
Zeros 444
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • LIVINGAREA_MODE is skewed right (γ1 = 2.9025)

Quantile Statistics

Minimum 0
5-th Percentile 0.0081
Q1 0.0433
Median 0.0735
Q3 0.1262
95-th Percentile 0.3267
Maximum 1
Range 1
IQR 0.0829

Descriptive Statistics

Mean 0.106
Standard Deviation 0.1118
Variance 0.01251
Sum 16231.2447
Skewness 2.9025
Kurtosis 12.4586
Coefficient of Variation 1.0554
  • LIVINGAREA_MODE is not normally distributed (p-value 1.6524594136118725e-09)
  • LIVINGAREA_MODE has 12968 outliers

NONLIVINGAPARTMENTS_MODE

numerical

Approximate Distinct Count 167
Approximate Unique (%) 0.2%
Missing 213514
Missing (%) 69.4%
Infinite 0
Infinite (%) 0.0%
Memory Size 1503952
Mean 0.008076
Minimum 0
Maximum 1
Zeros 59255
Zeros (%) 19.3%
Negatives 0
Negatives (%) 0.0%
  • NONLIVINGAPARTMENTS_MODE is skewed right (γ1 = 16.2516)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0.0039
95-th Percentile 0.0272
Maximum 1
Range 1
IQR 0.0039

Descriptive Statistics

Mean 0.008076
Standard Deviation 0.04628
Variance 0.002141
Sum 759.1562
Skewness 16.2516
Kurtosis 309.7104
Coefficient of Variation 5.7298
  • NONLIVINGAPARTMENTS_MODE is not normally distributed (p-value 4.698991503072799e-25)
  • NONLIVINGAPARTMENTS_MODE has 14224 outliers

NONLIVINGAREA_MODE

numerical

Approximate Distinct Count 3327
Approximate Unique (%) 2.4%
Missing 169682
Missing (%) 55.2%
Infinite 0
Infinite (%) 0.0%
Memory Size 2205264
Mean 0.02702
Minimum 0
Maximum 1
Zeros 67126
Zeros (%) 21.8%
Negatives 0
Negatives (%) 0.0%
  • NONLIVINGAREA_MODE is skewed right (γ1 = 6.5224)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0.0012
Q3 0.0233
95-th Percentile 0.1321
Maximum 1
Range 1
IQR 0.0233

Descriptive Statistics

Mean 0.02702
Standard Deviation 0.07025
Variance 0.004936
Sum 3724.4593
Skewness 6.5224
Kurtosis 63.3547
Coefficient of Variation 2.5998
  • NONLIVINGAREA_MODE is not normally distributed (p-value 1.2300077913343854e-24)
  • NONLIVINGAREA_MODE has 18704 outliers

APARTMENTS_MEDI

numerical

Approximate Distinct Count 1148
Approximate Unique (%) 0.8%
Missing 156061
Missing (%) 50.7%
Infinite 0
Infinite (%) 0.0%
Memory Size 2423200
Mean 0.1178
Minimum 0
Maximum 1
Zeros 771
Zeros (%) 0.3%
Negatives 0
Negatives (%) 0.0%
  • APARTMENTS_MEDI is skewed right (γ1 = 2.6392)

Quantile Statistics

Minimum 0
5-th Percentile 0.0094
Q1 0.0583
Median 0.088
Q3 0.1499
95-th Percentile 0.3331
Maximum 1
Range 1
IQR 0.0916

Descriptive Statistics

Mean 0.1178
Standard Deviation 0.1091
Variance 0.0119
Sum 17848.3705
Skewness 2.6392
Kurtosis 11.2418
Coefficient of Variation 0.9255
  • APARTMENTS_MEDI is not normally distributed (p-value 1.8083460016959982e-09)
  • APARTMENTS_MEDI has 10573 outliers

BASEMENTAREA_MEDI

numerical

Approximate Distinct Count 3772
Approximate Unique (%) 3.0%
Missing 179943
Missing (%) 58.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 2041088
Mean 0.08795
Minimum 0
Maximum 1
Zeros 14991
Zeros (%) 4.9%
Negatives 0
Negatives (%) 0.0%
  • BASEMENTAREA_MEDI is skewed right (γ1 = 3.553)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0.0441
Median 0.0764
Q3 0.1123
95-th Percentile 0.2258
Maximum 1
Range 1
IQR 0.0682

Descriptive Statistics

Mean 0.08795
Standard Deviation 0.08218
Variance 0.006753
Sum 11220.2249
Skewness 3.553
Kurtosis 25.8287
Coefficient of Variation 0.9343
  • BASEMENTAREA_MEDI is not normally distributed (p-value 2.7896403778183123e-09)
  • BASEMENTAREA_MEDI has 7084 outliers

YEARS_BEGINEXPLUATATION_MEDI

numerical

Approximate Distinct Count 245
Approximate Unique (%) 0.2%
Missing 150007
Missing (%) 48.8%
Infinite 0
Infinite (%) 0.0%
Memory Size 2520064
Mean 0.9778
Minimum 0
Maximum 1
Zeros 548
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • YEARS_BEGINEXPLUATATION_MEDI is skewed left (γ1 = -15.573)

Quantile Statistics

Minimum 0
5-th Percentile 0.9687
Q1 0.9767
Median 0.9821
Q3 0.9866
95-th Percentile 0.996
Maximum 1
Range 1
IQR 0.0099

Descriptive Statistics

Mean 0.9778
Standard Deviation 0.0599
Variance 0.003588
Sum 153999.8926
Skewness -15.573
Kurtosis 248.3886
Coefficient of Variation 0.06126
  • YEARS_BEGINEXPLUATATION_MEDI is not normally distributed (p-value 3.9430548497683214e-20)
  • YEARS_BEGINEXPLUATATION_MEDI has 4762 outliers

YEARS_BUILD_MEDI

numerical

Approximate Distinct Count 151
Approximate Unique (%) 0.1%
Missing 204488
Missing (%) 66.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 1648368
Mean 0.7557
Minimum 0
Maximum 1
Zeros 101
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • YEARS_BUILD_MEDI is skewed left (γ1 = -0.9628)

Quantile Statistics

Minimum 0
5-th Percentile 0.6042
Q1 0.6914
Median 0.7585
Q3 0.8256
95-th Percentile 0.953
Maximum 1
Range 1
IQR 0.1342

Descriptive Statistics

Mean 0.7557
Standard Deviation 0.1121
Variance 0.01256
Sum 77859.2482
Skewness -0.9628
Kurtosis 4.4685
Coefficient of Variation 0.1483
  • YEARS_BUILD_MEDI is not normally distributed (p-value 0.004215922173767969)
  • YEARS_BUILD_MEDI has 2274 outliers

COMMONAREA_MEDI

numerical

Approximate Distinct Count 3202
Approximate Unique (%) 3.5%
Missing 214865
Missing (%) 69.9%
Infinite 0
Infinite (%) 0.0%
Memory Size 1482336
Mean 0.0446
Minimum 0
Maximum 1
Zeros 8691
Zeros (%) 2.8%
Negatives 0
Negatives (%) 0.0%
  • COMMONAREA_MEDI is skewed right (γ1 = 5.4192)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0.0079
Median 0.0212
Q3 0.0516
95-th Percentile 0.1627
Maximum 1
Range 1
IQR 0.0437

Descriptive Statistics

Mean 0.0446
Standard Deviation 0.07614
Variance 0.005798
Sum 4131.5578
Skewness 5.4192
Kurtosis 45.2611
Coefficient of Variation 1.7075
  • COMMONAREA_MEDI is not normally distributed (p-value 3.2699570632518226e-21)
  • COMMONAREA_MEDI has 7923 outliers

ELEVATORS_MEDI

numerical

Approximate Distinct Count 46
Approximate Unique (%) 0.0%
Missing 163891
Missing (%) 53.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 2297920
Mean 0.07808
Minimum 0
Maximum 1
Zeros 87026
Zeros (%) 28.3%
Negatives 0
Negatives (%) 0.0%
  • ELEVATORS_MEDI is skewed right (γ1 = 2.4578)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0.12
95-th Percentile 0.36
Maximum 1
Range 1
IQR 0.12

Descriptive Statistics

Mean 0.07808
Standard Deviation 0.1345
Variance 0.01808
Sum 11213.54
Skewness 2.4578
Kurtosis 7.9637
Coefficient of Variation 1.7222
  • ELEVATORS_MEDI is not normally distributed (p-value 3.093195803492825e-24)
  • ELEVATORS_MEDI has 10383 outliers

ENTRANCES_MEDI

numerical

Approximate Distinct Count 46
Approximate Unique (%) 0.0%
Missing 154828
Missing (%) 50.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 2442928
Mean 0.1492
Minimum 0
Maximum 1
Zeros 329
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • ENTRANCES_MEDI is skewed right (γ1 = 2.3877)

Quantile Statistics

Minimum 0
5-th Percentile 0.0345
Q1 0.069
Median 0.1379
Q3 0.2069
95-th Percentile 0.3103
Maximum 1
Range 1
IQR 0.1379

Descriptive Statistics

Mean 0.1492
Standard Deviation 0.1004
Variance 0.01007
Sum 22782.255
Skewness 2.3877
Kurtosis 11.4737
Coefficient of Variation 0.6727
  • ENTRANCES_MEDI is not normally distributed (p-value 3.835298165803132e-12)
  • ENTRANCES_MEDI has 3893 outliers

FLOORSMAX_MEDI

numerical

Approximate Distinct Count 49
Approximate Unique (%) 0.0%
Missing 153020
Missing (%) 49.8%
Infinite 0
Infinite (%) 0.0%
Memory Size 2471856
Mean 0.2259
Minimum 0
Maximum 1
Zeros 2995
Zeros (%) 1.0%
Negatives 0
Negatives (%) 0.0%
  • FLOORSMAX_MEDI is skewed right (γ1 = 1.2402)

Quantile Statistics

Minimum 0
5-th Percentile 0.0417
Q1 0.1667
Median 0.1667
Q3 0.3333
95-th Percentile 0.5
Maximum 1
Range 1
IQR 0.1666

Descriptive Statistics

Mean 0.2259
Standard Deviation 0.1451
Variance 0.02104
Sum 34898.9901
Skewness 1.2402
Kurtosis 2.4684
Coefficient of Variation 0.6422
  • FLOORSMAX_MEDI is not normally distributed (p-value 1.1814747597073758e-19)
  • FLOORSMAX_MEDI has 5360 outliers

FLOORSMIN_MEDI

numerical

Approximate Distinct Count 47
Approximate Unique (%) 0.0%
Missing 208642
Missing (%) 67.8%
Infinite 0
Infinite (%) 0.0%
Memory Size 1581904
Mean 0.2316
Minimum 0
Maximum 1
Zeros 2351
Zeros (%) 0.8%
Negatives 0
Negatives (%) 0.0%
  • FLOORSMIN_MEDI is skewed right (γ1 = 0.9602)

Quantile Statistics

Minimum 0
5-th Percentile 0.0417
Q1 0.0833
Median 0.2083
Q3 0.375
95-th Percentile 0.5
Maximum 1
Range 1
IQR 0.2917

Descriptive Statistics

Mean 0.2316
Standard Deviation 0.1619
Variance 0.02622
Sum 22900.526
Skewness 0.9602
Kurtosis 1.3464
Coefficient of Variation 0.6991
  • FLOORSMIN_MEDI is not normally distributed (p-value 5.331066817129837e-17)
  • FLOORSMIN_MEDI has 344 outliers

LANDAREA_MEDI

numerical

Approximate Distinct Count 3560
Approximate Unique (%) 2.9%
Missing 182590
Missing (%) 59.4%
Infinite 0
Infinite (%) 0.0%
Memory Size 1998736
Mean 0.06717
Minimum 0
Maximum 1
Zeros 15919
Zeros (%) 5.2%
Negatives 0
Negatives (%) 0.0%
  • LANDAREA_MEDI is skewed right (γ1 = 4.3682)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0.0193
Median 0.049
Q3 0.0874
95-th Percentile 0.2
Maximum 1
Range 1
IQR 0.0681

Descriptive Statistics

Mean 0.06717
Standard Deviation 0.08217
Variance 0.006751
Sum 8390.7873
Skewness 4.3682
Kurtosis 33.2366
Coefficient of Variation 1.2233
  • LANDAREA_MEDI is not normally distributed (p-value 3.631301953256737e-13)
  • LANDAREA_MEDI has 6833 outliers

LIVINGAPARTMENTS_MEDI

numerical

Approximate Distinct Count 1097
Approximate Unique (%) 1.1%
Missing 210199
Missing (%) 68.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 1556992
Mean 0.102
Minimum 0
Maximum 1
Zeros 433
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • LIVINGAPARTMENTS_MEDI is skewed right (γ1 = 2.9882)

Quantile Statistics

Minimum 0
5-th Percentile 0.0103
Q1 0.0513
Median 0.077
Q3 0.1231
95-th Percentile 0.2805
Maximum 1
Range 1
IQR 0.0718

Descriptive Statistics

Mean 0.102
Standard Deviation 0.09364
Variance 0.008769
Sum 9921.3937
Skewness 2.9882
Kurtosis 15.6962
Coefficient of Variation 0.9185
  • LIVINGAPARTMENTS_MEDI is not normally distributed (p-value 2.418475288217974e-10)
  • LIVINGAPARTMENTS_MEDI has 7927 outliers

LIVINGAREA_MEDI

numerical

Approximate Distinct Count 5281
Approximate Unique (%) 3.4%
Missing 154350
Missing (%) 50.2%
Infinite 0
Infinite (%) 0.0%
Memory Size 2450576
Mean 0.1086
Minimum 0
Maximum 1
Zeros 299
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • LIVINGAREA_MEDI is skewed right (γ1 = 2.8489)

Quantile Statistics

Minimum 0
5-th Percentile 0.0085
Q1 0.0463
Median 0.0749
Q3 0.1318
95-th Percentile 0.3294
Maximum 1
Range 1
IQR 0.0855

Descriptive Statistics

Mean 0.1086
Standard Deviation 0.1123
Variance 0.0126
Sum 16634.3163
Skewness 2.8489
Kurtosis 12.1381
Coefficient of Variation 1.0336
  • LIVINGAREA_MEDI is not normally distributed (p-value 1.2743287365273831e-09)
  • LIVINGAREA_MEDI has 12609 outliers

NONLIVINGAPARTMENTS_MEDI

numerical

Approximate Distinct Count 214
Approximate Unique (%) 0.2%
Missing 213514
Missing (%) 69.4%
Infinite 0
Infinite (%) 0.0%
Memory Size 1503952
Mean 0.008651
Minimum 0
Maximum 1
Zeros 56097
Zeros (%) 18.2%
Negatives 0
Negatives (%) 0.0%
  • NONLIVINGAPARTMENTS_MEDI is skewed right (γ1 = 15.6717)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0.0039
95-th Percentile 0.0311
Maximum 1
Range 1
IQR 0.0039

Descriptive Statistics

Mean 0.008651
Standard Deviation 0.04741
Variance 0.002248
Sum 813.1693
Skewness 15.6717
Kurtosis 289.477
Coefficient of Variation 5.4808
  • NONLIVINGAPARTMENTS_MEDI is not normally distributed (p-value 4.786582651414071e-25)
  • NONLIVINGAPARTMENTS_MEDI has 15215 outliers

NONLIVINGAREA_MEDI

numerical

Approximate Distinct Count 3323
Approximate Unique (%) 2.4%
Missing 169682
Missing (%) 55.2%
Infinite 0
Infinite (%) 0.0%
Memory Size 2205264
Mean 0.02824
Minimum 0
Maximum 1
Zeros 60954
Zeros (%) 19.8%
Negatives 0
Negatives (%) 0.0%
  • NONLIVINGAREA_MEDI is skewed right (γ1 = 6.5088)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0.0032
Q3 0.0269
95-th Percentile 0.1316
Maximum 1
Range 1
IQR 0.0269

Descriptive Statistics

Mean 0.02824
Standard Deviation 0.07017
Variance 0.004923
Sum 3891.7287
Skewness 6.5088
Kurtosis 63.6517
Coefficient of Variation 2.485
  • NONLIVINGAREA_MEDI is not normally distributed (p-value 1.6700961249898202e-24)
  • NONLIVINGAREA_MEDI has 17065 outliers

FONDKAPREMONT_MODE

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 210295
Missing (%) 68.4%
Memory Size 7917835
  • The largest value (reg oper account) is over 6.11 times larger than the second largest value (reg oper spec account)

Length

Mean 16.4458
Standard Deviation 1.8532
Median 16
Minimum 13
Maximum 21

Sample

1st row reg oper account
2nd row reg oper account
3rd row reg oper account
4th row reg oper account
5th row reg oper account

Letter

Count 1397970
Lowercase Letter 1397970
Space Separator 200825
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0

HOUSETYPE_MODE

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 154297
Missing (%) 50.2%
Memory Size 12106904
  • The largest value (block of flats) is over 100.4 times larger than the second largest value (specific housing)

Length

Mean 14.0196
Standard Deviation 0.1969
Median 14
Minimum 14
Maximum 16

Sample

1st row block of flats
2nd row block of flats
3rd row block of flats
4th row block of flats
5th row block of flats

Letter

Count 1844277
Lowercase Letter 1844277
Space Separator 303717
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0

TOTALAREA_MODE

numerical

Approximate Distinct Count 5116
Approximate Unique (%) 3.2%
Missing 148431
Missing (%) 48.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 2545280
Mean 0.1025
Minimum 0
Maximum 1
Zeros 582
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • TOTALAREA_MODE is skewed right (γ1 = 2.7975)

Quantile Statistics

Minimum 0
5-th Percentile 0.0068
Q1 0.0415
Median 0.0691
Q3 0.1292
95-th Percentile 0.3116
Maximum 1
Range 1
IQR 0.0877

Descriptive Statistics

Mean 0.1025
Standard Deviation 0.1075
Variance 0.01155
Sum 16313.1231
Skewness 2.7975
Kurtosis 12.1671
Coefficient of Variation 1.0479
  • TOTALAREA_MODE is not normally distributed (p-value 8.23179296045319e-11)
  • TOTALAREA_MODE has 11765 outliers

WALLSMATERIAL_MODE

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.0%
Missing 156341
Missing (%) 50.8%
Memory Size 11051487

Length

Mean 8.1064
Standard Deviation 3.4205
Median 5
Minimum 5
Maximum 12

Sample

1st row Stone, brick
2nd row Block
3rd row Panel
4th row Panel
5th row Stone, brick

Letter

Count 1095807
Lowercase Letter 944637
Space Separator 64815
Uppercase Letter 151170
Dash Punctuation 0
Decimal Number 0

EMERGENCYSTATE_MODE

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 145755
Missing (%) 47.4%
Memory Size 10839980
  • The largest value (No) is over 68.48 times larger than the second largest value (Yes)

Length

Mean 2.0144
Standard Deviation 0.1191
Median 2
Minimum 2
Maximum 3

Sample

1st row No
2nd row No
3rd row No
4th row No
5th row No

Letter

Count 325840
Lowercase Letter 164084
Space Separator 0
Uppercase Letter 161756
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 68.48 times larger than the second largest value (yes)

OBS_30_CNT_SOCIAL_CIRCLE

numerical

Approximate Distinct Count 33
Approximate Unique (%) 0.0%
Missing 1021
Missing (%) 0.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 4903840
Mean 1.4222
Minimum 0
Maximum 348
Zeros 163910
Zeros (%) 53.3%
Negatives 0
Negatives (%) 0.0%
  • OBS_30_CNT_SOCIAL_CIRCLE is skewed right (γ1 = 12.1395)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 2
95-th Percentile 6
Maximum 348
Range 348
IQR 2

Descriptive Statistics

Mean 1.4222
Standard Deviation 2.401
Variance 5.7647
Sum 435904
Skewness 12.1395
Kurtosis 1424.7923
Coefficient of Variation 1.6882
  • OBS_30_CNT_SOCIAL_CIRCLE is not normally distributed (p-value 4.6762275543780575e-25)
  • OBS_30_CNT_SOCIAL_CIRCLE has 19971 outliers

DEF_30_CNT_SOCIAL_CIRCLE

numerical

Approximate Distinct Count 10
Approximate Unique (%) 0.0%
Missing 1021
Missing (%) 0.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 4903840
Mean 0.1434
Minimum 0
Maximum 34
Zeros 271324
Zeros (%) 88.2%
Negatives 0
Negatives (%) 0.0%
  • DEF_30_CNT_SOCIAL_CIRCLE is skewed right (γ1 = 5.1835)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 1
Maximum 34
Range 34
IQR 0

Descriptive Statistics

Mean 0.1434
Standard Deviation 0.4467
Variance 0.1995
Sum 43957
Skewness 5.1835
Kurtosis 126.3104
Coefficient of Variation 3.1146
  • DEF_30_CNT_SOCIAL_CIRCLE is not normally distributed (p-value 7.602340785170992e-25)
  • DEF_30_CNT_SOCIAL_CIRCLE has 35166 outliers

OBS_60_CNT_SOCIAL_CIRCLE

numerical

Approximate Distinct Count 33
Approximate Unique (%) 0.0%
Missing 1021
Missing (%) 0.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 4903840
Mean 1.4053
Minimum 0
Maximum 344
Zeros 164666
Zeros (%) 53.5%
Negatives 0
Negatives (%) 0.0%
  • OBS_60_CNT_SOCIAL_CIRCLE is skewed right (γ1 = 12.0708)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 2
95-th Percentile 6
Maximum 344
Range 344
IQR 2

Descriptive Statistics

Mean 1.4053
Standard Deviation 2.3798
Variance 5.6635
Sum 430708
Skewness 12.0708
Kurtosis 1409.6815
Coefficient of Variation 1.6935
  • OBS_60_CNT_SOCIAL_CIRCLE is not normally distributed (p-value 4.655631872232157e-25)
  • OBS_60_CNT_SOCIAL_CIRCLE has 19564 outliers

DEF_60_CNT_SOCIAL_CIRCLE

numerical

Approximate Distinct Count 9
Approximate Unique (%) 0.0%
Missing 1021
Missing (%) 0.3%
Infinite 0
Infinite (%) 0.0%
Memory Size 4903840
Mean 0.1
Minimum 0
Maximum 24
Zeros 280721
Zeros (%) 91.3%
Negatives 0
Negatives (%) 0.0%
  • DEF_60_CNT_SOCIAL_CIRCLE is skewed right (γ1 = 5.2779)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 1
Maximum 24
Range 24
IQR 0

Descriptive Statistics

Mean 0.1
Standard Deviation 0.3623
Variance 0.1313
Sum 30664
Skewness 5.2779
Kurtosis 86.5614
Coefficient of Variation 3.6211
  • DEF_60_CNT_SOCIAL_CIRCLE is not normally distributed (p-value 5.860143083307702e-25)
  • DEF_60_CNT_SOCIAL_CIRCLE has 25769 outliers

DAYS_LAST_PHONE_CHANGE

numerical

Approximate Distinct Count 3773
Approximate Unique (%) 1.2%
Missing 1
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 4920160
Mean -962.8588
Minimum -4292
Maximum 0
Zeros 37672
Zeros (%) 12.2%
Negatives 269838
Negatives (%) 87.8%
  • DAYS_LAST_PHONE_CHANGE is skewed left (γ1 = -0.7136)

Quantile Statistics

Minimum -4292
5-th Percentile -2517
Q1 -1565
Median -753
Q3 -269
95-th Percentile 0
Maximum 0
Range 4292
IQR 1296

Descriptive Statistics

Mean -962.8588
Standard Deviation 826.8085
Variance 683612.2742
Sum -2.9609e+08
Skewness -0.7136
Kurtosis -0.3086
Coefficient of Variation -0.8587
  • DAYS_LAST_PHONE_CHANGE is not normally distributed (p-value 2.8863704973640824e-17)
  • DAYS_LAST_PHONE_CHANGE has 449 outliers

FLAG_DOCUMENT_2

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 23653.69 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 23653.69 times larger than the second largest value (1)
  • FLAG_DOCUMENT_2 has words of constant length

FLAG_DOCUMENT_3

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (1) is over 2.45 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 0
4th row 1
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 2.45 times larger than the second largest value (0)
  • FLAG_DOCUMENT_3 has words of constant length

FLAG_DOCUMENT_4

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 12299.44 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 12299.44 times larger than the second largest value (1)
  • FLAG_DOCUMENT_4 has words of constant length

FLAG_DOCUMENT_5

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 65.16 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 65.16 times larger than the second largest value (1)
  • FLAG_DOCUMENT_5 has words of constant length

FLAG_DOCUMENT_6

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 10.36 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 10.36 times larger than the second largest value (1)
  • FLAG_DOCUMENT_6 has words of constant length

FLAG_DOCUMENT_7

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 5211.05 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 5211.05 times larger than the second largest value (1)
  • FLAG_DOCUMENT_7 has words of constant length

FLAG_DOCUMENT_8

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 11.29 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 11.29 times larger than the second largest value (1)
  • FLAG_DOCUMENT_8 has words of constant length

FLAG_DOCUMENT_9

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 255.69 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 255.69 times larger than the second largest value (1)
  • FLAG_DOCUMENT_9 has words of constant length

FLAG_DOCUMENT_10

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 43929.14 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 43929.14 times larger than the second largest value (1)
  • FLAG_DOCUMENT_10 has words of constant length

FLAG_DOCUMENT_11

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 254.62 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 254.62 times larger than the second largest value (1)
  • FLAG_DOCUMENT_11 has words of constant length

FLAG_DOCUMENT_12

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 153754.5 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 153754.5 times larger than the second largest value (1)
  • FLAG_DOCUMENT_12 has words of constant length

FLAG_DOCUMENT_13

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 282.68 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 282.68 times larger than the second largest value (1)
  • FLAG_DOCUMENT_13 has words of constant length

FLAG_DOCUMENT_14

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 339.54 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 339.54 times larger than the second largest value (1)
  • FLAG_DOCUMENT_14 has words of constant length

FLAG_DOCUMENT_15

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 825.64 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 825.64 times larger than the second largest value (1)
  • FLAG_DOCUMENT_15 has words of constant length

FLAG_DOCUMENT_16

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 99.72 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 99.72 times larger than the second largest value (1)
  • FLAG_DOCUMENT_16 has words of constant length

FLAG_DOCUMENT_17

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 3749.13 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 3749.13 times larger than the second largest value (1)
  • FLAG_DOCUMENT_17 has words of constant length

FLAG_DOCUMENT_18

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 122.0 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 122.0 times larger than the second largest value (1)
  • FLAG_DOCUMENT_18 has words of constant length

FLAG_DOCUMENT_19

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 1679.39 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 1679.39 times larger than the second largest value (1)
  • FLAG_DOCUMENT_19 has words of constant length

FLAG_DOCUMENT_20

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 1970.22 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 1970.22 times larger than the second largest value (1)
  • FLAG_DOCUMENT_20 has words of constant length

FLAG_DOCUMENT_21

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 20295726
  • The largest value (0) is over 2984.54 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 307511
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 2984.54 times larger than the second largest value (1)
  • FLAG_DOCUMENT_21 has words of constant length

AMT_REQ_CREDIT_BUREAU_HOUR

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.0%
Missing 41519
Missing (%) 13.5%
Memory Size 18087456
  • The largest value (0.0) is over 169.47 times larger than the second largest value (1.0)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row 0.0
2nd row 0.0
3rd row 0.0
4th row 0.0
5th row 0.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 531984
  • The top 2 categories (0.0, 1.0) take over 50.0%
  • The largest value (00) is over 169.47 times larger than the second largest value (10)
  • AMT_REQ_CREDIT_BUREAU_HOUR has words of constant length

AMT_REQ_CREDIT_BUREAU_DAY

numerical

Approximate Distinct Count 9
Approximate Unique (%) 0.0%
Missing 41519
Missing (%) 13.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 4255872
Mean 0.007
Minimum 0
Maximum 9
Zeros 264503
Zeros (%) 86.0%
Negatives 0
Negatives (%) 0.0%
  • AMT_REQ_CREDIT_BUREAU_DAY is skewed right (γ1 = 27.0434)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 9
Range 9
IQR 0

Descriptive Statistics

Mean 0.007
Standard Deviation 0.1108
Variance 0.01227
Sum 1862
Skewness 27.0434
Kurtosis 1151.8459
Coefficient of Variation 15.822
  • AMT_REQ_CREDIT_BUREAU_DAY is not normally distributed (p-value 4.232091108865669e-25)

AMT_REQ_CREDIT_BUREAU_WEEK

numerical

Approximate Distinct Count 9
Approximate Unique (%) 0.0%
Missing 41519
Missing (%) 13.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 4255872
Mean 0.03436
Minimum 0
Maximum 8
Zeros 257456
Zeros (%) 83.7%
Negatives 0
Negatives (%) 0.0%
  • AMT_REQ_CREDIT_BUREAU_WEEK is skewed right (γ1 = 9.2935)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 8
Range 8
IQR 0

Descriptive Statistics

Mean 0.03436
Standard Deviation 0.2047
Variance 0.0419
Sum 9140
Skewness 9.2935
Kurtosis 166.7491
Coefficient of Variation 5.9567
  • AMT_REQ_CREDIT_BUREAU_WEEK is not normally distributed (p-value 4.466906088934114e-25)
  • AMT_REQ_CREDIT_BUREAU_WEEK has 8536 outliers

AMT_REQ_CREDIT_BUREAU_MON

numerical

Approximate Distinct Count 24
Approximate Unique (%) 0.0%
Missing 41519
Missing (%) 13.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 4255872
Mean 0.2674
Minimum 0
Maximum 27
Zeros 222233
Zeros (%) 72.3%
Negatives 0
Negatives (%) 0.0%
  • AMT_REQ_CREDIT_BUREAU_MON is skewed right (γ1 = 7.8048)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 1
Maximum 27
Range 27
IQR 0

Descriptive Statistics

Mean 0.2674
Standard Deviation 0.916
Variance 0.8391
Sum 71125
Skewness 7.8048
Kurtosis 90.4331
Coefficient of Variation 3.4256
  • AMT_REQ_CREDIT_BUREAU_MON is not normally distributed (p-value 1.3394314799833658e-24)
  • AMT_REQ_CREDIT_BUREAU_MON has 43759 outliers

AMT_REQ_CREDIT_BUREAU_QRT

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.0%
Missing 41519
Missing (%) 13.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 4255872
Mean 0.2655
Minimum 0
Maximum 261
Zeros 215417
Zeros (%) 70.0%
Negatives 0
Negatives (%) 0.0%
  • AMT_REQ_CREDIT_BUREAU_QRT is skewed right (γ1 = 134.365)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 2
Maximum 261
Range 261
IQR 0

Descriptive Statistics

Mean 0.2655
Standard Deviation 0.7941
Variance 0.6305
Sum 70614
Skewness 134.365
Kurtosis 43706.6431
Coefficient of Variation 2.9911
  • AMT_REQ_CREDIT_BUREAU_QRT is not normally distributed (p-value 4.2265198917624505e-25)
  • AMT_REQ_CREDIT_BUREAU_QRT has 50575 outliers

AMT_REQ_CREDIT_BUREAU_YEAR

numerical

Approximate Distinct Count 25
Approximate Unique (%) 0.0%
Missing 41519
Missing (%) 13.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 4255872
Mean 1.9
Minimum 0
Maximum 25
Zeros 71801
Zeros (%) 23.4%
Negatives 0
Negatives (%) 0.0%
  • AMT_REQ_CREDIT_BUREAU_YEAR is skewed right (γ1 = 1.2436)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 1
Q3 3
95-th Percentile 6
Maximum 25
Range 25
IQR 3

Descriptive Statistics

Mean 1.9
Standard Deviation 1.8693
Variance 3.4943
Sum 505378
Skewness 1.2436
Kurtosis 1.969
Coefficient of Variation 0.9839
  • AMT_REQ_CREDIT_BUREAU_YEAR is not normally distributed (p-value 4.076666063254506e-13)
  • AMT_REQ_CREDIT_BUREAU_YEAR has 3364 outliers

Interactions

Correlations

Missing Values